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DETAILED ACTION 



1 . In view of the Appeal Brief filed on May 21 , 2008, PROSECUTION IS HEREBY 
REOPENED. A new ground of rejection is set forth below. 

To avoid abandonment of the application, appellant must exercise one of the 
following two options: 

(1 ) file a reply under 37 CFR 1.111 (if this Office action is non-final) or a reply 
under 37 CFR 1.113 (if this Office action is final); or, 

(2) initiate a new appeal by filing a notice of appeal under 37 CFR 41 .31 followed 
by an appeal brief under 37 CFR 41 .37. The previously paid notice of appeal fee and 
appeal brief fee can be applied to the new appeal. If, however, the appeal fees set forth 
in 37 CFR 41 .20 have been increased since they were previously paid, then appellant 
must pay the difference between the increased fees and the amount previously paid. 

A Supervisory Patent Examiner (SPE) has approved of reopening prosecution by 
signing below: 

/James K. Trujillo/ 

Supervisory Patent Examiner, Art Unit 2169 
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Information Disclosure Statement 

2. The information disclosure statement (IDS) submitted on 3/28/2008 was filed 
after the mailing date of the Advisory Action on February 19, 2008. The submission is 
in compliance with the provisions of 37 CFR 1 .97. Accordingly, the information 
disclosure statement is being considered by the examiner. 



Double Patenting 



3. The nonstatutory double patenting rejection is based on a judicially created 
doctrine grounded in public policy (a policy reflected in the statute) so as to prevent the 
unjustified or improper timewise extension of the "right to exclude" granted by a patent 
and to prevent possible harassment by multiple assignees. A nonstatutory 
obviousness-type double patenting rejection is appropriate where the conflicting claims 
are not identical, but at least one examined application claim is not patentably distinct 
from the reference claim(s) because the examined application claim is either anticipated 
by, or would have been obvious over, the reference claim(s). See, e.g., In re Berg, 140 
F.3d 1428, 46 USPQ2d 1226 (Fed. Cir. 1998); In re Goodman, 11 F.3d 1046, 29 
USPQ2d 2010 (Fed. Cir. 1993); In re Longi, 759 F.2d 887, 225 USPQ 645 (Fed. Cir. 
1985); In re Van Ornum, 686 F.2d 937, 214 USPQ 761 (CCPA 1982); In re Vogel, 422 
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F.2d 438, 164 USPQ 619 (CCPA 1970); and In re Thorington, 418 F.2d 528, 163 
USPQ 644 (CCPA 1969). 

A timely filed terminal disclaimer in compliance with 37 CFR 1 .321 (c) or 1 .321 (d) 
may be used to overcome an actual or provisional rejection based on a nonstatutory 
double patenting ground provided the conflicting application or patent either is shown to 
be commonly owned with this application, or claims an invention made as a result of 
activities undertaken within the scope of a joint research agreement. 

Effective January 1 , 1994, a registered attorney or agent of record may sign a 
terminal disclaimer. A terminal disclaimer signed by the assignee must fully comply with 
37 CFR 3.73(b). 

4. Claims 1 , 20, 39, 58, 81-84 are provisionally rejected on the ground of 
nonstatutory obviousness-type double patenting as being unpatentable over claims 1, 
16, 31-32 of copending Application No. 10/626,856. Although the conflicting claims are 
not identical, they are not patentably distinct from each other because the claims use 
determining steps that are clearly similar. For example in claim 1 of the instant 
application applicant states "determining source-identified training stories", in claim 1 of 
application 10/626,856 applicant states "determining a source-identified story corpus, 
each story associated with at least one event". In effect both claims state the same 
thing. Other steps in reminder of the claims follow the same reasoning. This is a 
provisional obviousness-type double patenting rejection because the conflicting claims 
have not in fact been patented. 
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Claim Rejections - 35 USC § 103 

5. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

6. Claims 1-5,9-10, 14-.24, 28-29, 33-43, 47-48, 52-62, 66-67, 71-76 and 81-88 are 
rejected under 35 U.S.C. 103(a) as being unpatentable over Sundaresan et al. (US 
Patent 6,606,620 B1 ) in view of Pirolli et al. (US 5,835,905) and further in view of 
Mayburyetal. (US 6;961 ,954 B1). 

As per claim 1 Sundaresan et al. is directed to a computer-implemented method 
of determining predictive models for a linked event detection system comprising the 
steps of: determining source-identified training stories (column 3, lines 16-17, wherein 
"stories" means "documents"); determining link label information for the at least one 
story-pair (column 9, lines 8-9); determining and storing at least one predictive model in 
the memory based on the inter-story similarity vectors and the link label information 
(colum n 10, lines 5-13); and Sundaresan et al. does not teach determining inter-story 
similarity vectors for at least one story-pair. Pirolli et al. teaches determining inter-story 
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similarity vectors for at least one story-pair (Pirolli et al., column 7, lines 53-65, wherein 
"pages" could mean "stories"). It would have been obvious to one in of ordinary skill in 
the art at the time the invention was made to modify Sundaresan et al. by teachings of 
Pirolli et al. to include determining inter-story similarity vectors for at least one story-pair 
because it provides the similarity measure of documents. Sundaresan et al. as modified 
still does not teach the link label information indicating the existence of at least one link 
between a pair of stories in the source-identified training stories and that the linked 
source-identified stories are related to the same event. Maybury et al. does teach the 
link label information indicating the existence of at least one link between a pair of 
stories in the source-identified training stories and that the linked source-identified 
stories are related to the same event (paragraph 16, lines 31-33). 
It would have been obvious to one of ordinary skill in the art at the time the invention 
was made to combine the Sundaresan et al. as modified by teachings of Maybury et al. 
to include the link label information indicating the existence of at least one link between 
a pair of stories in the source-identified training stories and that the linked source 
identified stories are related to the same event because they indicate 
related information (Maybury et al., paragraph 16, line 33). 

As per claim 2 Sundaresan et al. as modified is directed to a step of determining 
inter-story similarity vectors comprises the steps of: determining at least one inter-story 
similarity metric for the story-pairs (Sundaresan et al., column 4, lines 9-25); and 
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determining at least one source-pair statistics for the at least one story-pair 
(Sundaresan et al., column 10, lines 15-17). 

As per claim 3 Sundaresan et al. as modified is directed to a determining inter- 
story similarity vectors further comprise the step of normalizing the inter-story similarity 
metric based on the source-pair statistics (Sundaresan et al., column 10, lines 17-22). 

As per claim 4 Sundaresan et al. as modified is directed to a determining inter- 
story similarity vectors further comprise the step of incrementally normalizing the inter- 
story similarity metric based on the source-pair statistics (Sundaresan et al., column 10, 
lines 16-22). 

As per claim 5 Sundaresan et al. as modified is directed to the inter-story 
similarity metric is normalized based on at least one of subtraction and division 
(Sundaresan et al., column 8, lines 22-27). 

As per claim 9 Sundaresan et al. as modified is directed to a comprising the 
steps of transforming the source-identified training stories (Sundaresan et al., column 1, 
line 63, wherein the "training stories" are in English). 

As per claim 10 Sundaresan et al. as modified is directed to transforming the 
source-identified training stories is at least one of translating, transcribing and 
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linguistically transforming (Sundaresan et al., column 1 , line 63; column 2, line 43, 
wherein the HTML and XML are in English, therefore translation will not be necessary). 

As per claim 14 Sundaresan et al. as modified is directed to at least one inter- 
story similarity metric is normalized based on at least one of a source-pair identified 
similarity statistic (Sundaresan et al., column 10, lines 15-17). 

As per claim 15 Sundaresan et al. as modified is directed to at least one 
predictive model is at least one of: a classifier, a support vector machine, a decision tree 
and a Naive-Bayes classifier (Sundaresan et al., column 3, lines 13-14). 

As per claim 1 6 Sundaresan et al. as modified is directed to at least one of the 
source-pair similarity statistics are determined based on a source hierarchy 
(Sundaresan et al., column 3, lines 50-51). 

As per claim 17 Sundaresan et al. as modified is directed to the source hierarchy 
is determined based on at least one source characteristic (Sundaresan et al., column 3, 
lines 61-65, wherein "characteristic" means "leaf). 

As per claim 18 Sundaresan et al. as modified is directed to the source 
characteristic is at least one of a language characteristic, an input mode characteristic, 
a genre characteristic, a source name characteristic and a transformation characteristic 
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(Sundaresan et al., column 3, lines 54-60, wherein "language characteristic" means how 
the words in a document are related to each other). 

As per claim 19 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristic of the new source (Sundaresan et al.. column 3, lines 50-53, wherein 
each new source has different hierarchy). 

As per claim 20 Sundaresan et al. is directed to a linked event detection training 
system comprising: 

an input/output circuit (column 7, lines 34-35, wherein it is inherent for computer 
to have input/output device circuit); 

a memory (column 7, lines 34-351 wherein it is inherent for computer to have 
memory); 

a processor that receives source-identified training stories and associated link 
label information for at least one story-pair via the input/output circuit (column 7, lines 
34-35, wherein it is inherent for computer to have a processor); 

and a predictive model determining circuit that determines and stores at least 
one predictive model based on the inter-story similarity vectors and the link label 
information (column 10, lines 5-13) 
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Sundaresan et al. does not teach an inter-story similarity vector determining 
circuit that determines an inter-story similarity vectors in memory for at least one story- 
pair of the source-identified stories. 

Pirolli et al. teaches an inter-story similarity vector determining circuit that 
determines an inter-story similarity vectors in memory for at least one story-pair of the 
source-identified stories (Pirolli et al., column 7, lines 53-65, wherein "pages" could 
mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
an inter-story similarity vector determining circuit that determines an inter-story similarity 
vectors in memory for at least one story-pair of the source-identified stories because it 
provides the similarity measure of documents. 

Sundaresan et al. as modified still does not teach the link label information 
indicating the existence of at least one link between a pair of stories in the source- 
identified training stories and that the linked source-identified stories are related to the 
same event. 

Maybury et al. does teach the link label information indicating the existence of at 
least one link between a pair of stories in the source-identified training stories and that 
the linked source-identified stories are related to the same event (paragraph 16, lines 
31-33). 
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It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the Sundaresan et al. as modified by teachings of 
Maybury et al. to include the link label information indicating the existence of at least 
one link between a pair of stories in the source-identified training stories and that the 
linked source-identified stories are related to the same event because they indicate 
related information (Maybury et al., paragraph 16, line 33). 

As per claim 21 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit is comprised of: 

a similarity metric determining circuit that determines at least one inter-story 
similarity metric for the at least one story-pair (Sundaresan et al., column 4, lines 9-25); 

and a similarity statistics determining circuit that determines at least one source- 
pair statistic for the at least one story-pair (Sundaresan et al., column 1 0, lines 1 5-1 7). 

As per claim 22 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit normalizes the inter-story similarity metric based on 
the source-pair statistics (Sundaresan et al., column 10, lines 17-22). 

As per claim 23 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit incrementally normalizes the inter-story similarity 
metric based on the source-pair statistics (Sundaresan et al., column 10, lines 16-22). 
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As per claim 24 Sundaresan et al. as modified is directed to at least one of the 
inter-story similarity metrics is normalized based on at least one of a subtraction and a 
division operation (.Sundaresan et al., column 8, lines 22-27). 

As per claim 28 Sundaresan et al. as modified is directed to a comprising the 
step of transforming the source-identified training stories (Sundaresan et al., column 1, 
line 63, wherein the "training stories" are in English). 

As per claim 29 Sundaresan et al. as modified is directed to transforming the 
source-identified training stories is at least one of translating, transcribing and 
linguistically transforming (Sundaresan et al., column 1, line 63; column 2, line 43, 
wherein the HTML and XML are in English therefore translation will not be necessary). 

As per claim 33 Sundaresan et al. as modified is directed to the at least one 
inter-story similarity metric is normalized based on at least one of a source-pair 
identified similarity statistic (Sundaresan et al., column 10, lines 15-17). 

As per claim 34 Sundaresan et al. as modified is directed to the at least one 
predictive model is at least one of: a classifier, a support vector machine, a decision tree 
and a Naive-Bayes classifier (Sundaresan etal., column 3, lines 13-14). 

As per claim 35 Sundaresan et al. as modified is directed to the source-pair 
identified similarity statistic is determined based on a source hierarchy (Sundaresan et 
al., column 3, lines 50-51). 
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As per claim 36 Sundaresan et al. as modified is directed to the source hierarchy 
is determined based on at least one of a source characteristic (Sundaresan et al., 
column 3, lines 61-65, wherein "characteristic" means "leaf). 

As per claim 37 Sundaresan et al. as modified is directed to the source 
characteristic is at least one of a language characteristic, an input mode characteristic, 
a genre characteristic, a source name characteristic and a transformation characteristic 
(Sundaresan et al., column 3, lines 54-60, wherein "language characteristic" means how 
the words in a document are related to each other). 

As per claim 38 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristics of the new source (Sundaresan et al., column 3, lines 50-53, wherein 
each new source has different hierarchy). 

As per claim 39 Sundaresan et al. is directed to a computer-implemented method 
of linked event detection comprising the steps of: 

determining source-identified stories (column 3, lines 16-17, wherein "stories" 
means "documents"); 

determining at least one predictive model in the memory for link detection 
(column 10, lines 5-13); 
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and determining a link between the story-pairs based on the predictive model 

and the inter-story similarity vector (column 10, lines 5-13, wherein sorting determines 

the link); and 

displaying the link on a computer or storing the link in an information repository 
(column 6, lines 57-59) 

Sundaresan et al. does not teach determining inter-story similarity vectors in a 
memory for the story-pairs of the source- verified stories. 

Pirolli et al. teaches determining inter-story similarity vectors in a memory for the 
story-pairs of the source-identified stories (Pirolli et al., column 7, lines 53-65, wherein 
"pages" could mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
determining inter-story similarity vectors in a memory for the story-pairs of the source- 
verified stories because it provides the similarity measure of documents. 

Sundaresan et al. as combined still does not teach the link indicating the story- 
pair are related to the same event. 

Maybury et al. does teach the link indicating the story-pair are related to the 
same event (paragraph 16, lines 31-33). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the Sundaresan et al. as modified by teachings of 
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Maybury et al. to include the link indicating the story-pair are related to the same event 
(Maybury et al., paragraph 16, line 33). 

As per claim 40 Sundaresan et al. as modified is directed to a step of determining 
inter-story similarity vectors comprises the steps of: 

determining at least one inter-story similarity metric for each story-pair 
(Sundaresan et al., column 4, lines 9-25); 

and determining source-pair statistics for the story-pairs (Sundaresan et al., 
column 10, lines 15-17). 



As per claim 41 Sundaresan et al. as modified is directed to a determining inter- 
story similarity vectors further comprise the step of normalizing the inter-story similarity 
metric based on the source-pair statistics (Sundaresan et al., column 10, lines 17-22). 

As per claim 42 Sundaresan et al. as modified is directed to a determining inter- 
story similarity vectors further comprise the step of incrementally normalizing the inter- 
story similarity metric based on the source-pair statistics (Sundaresan et al., column 10, 
lines 16-22). 

As per claim 43 Sundaresan et al. as modified is directed to the inter-story 
similarity metric is normalized based on at least one of subtraction and division 
(Sundaresan et al., column 8, lines 22-27). 
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As per claim 47 Sundaresan et al. as modified is directed to a comprising the 
step of transforming the source-identified training stories (column 1, line 63, wherein the 
"training stories" are in English). 

As per claim 48 Sundaresan et al. as modified is directed to transforming the 
source-identified training stories is at least one of translating, transcribing and 
linguistically transforming (Sundaresan et al., column 1, line 63; column 2, line 43, 
wherein the HTML and XML are in English therefore translation will not be necessary). 

As per claim 52 Sundaresan et al. as modified is directed to the at least one 
inter-story similarity metric is normalized based on at least one of a source-pair 
identified similarity statistic (Sundaresan et al., column 10, lines 15-17). 

As per claim 53 Sundaresan et al. as modified is directed to the at least one 
predictive model is at least one of: a classifier, a support vector machine and a decision 
tree, a Naive-Bayes-classifier (Sundaresan et al., column 8, lines 22-27). 

As per claim 54 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristics of the new source (Sundaresan et al., column 3, lines 50-51). 

As per claim 55 Sundaresan et al. as modified is directed to the source hierarchy 
is determined based on at least one of a source characteristic (Sundaresan et al., 
column 3, lines 61-65, wherein "characteristic" means "leaf). 
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As per claim 56 Sundaresan et al. as modified is directed to the source 
characteristic is at least one of a language characteristic, an input mode characteristic, 
a genre characteristic, a source name characteristic and a transformation characteristic 
(Sundaresan et al., column 3, lines 54-60, wherein "language characteristic" means how 
the words in a document are related to each other). 

As per claim 57 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristics of the new source (Sundaresan et al., column 3, lines 50-53, wherein 
each new source has different hierarchy). 

As per claim 58 Sundaresan et al. is directed to linked event detection system 
comprising: 

an input/output circuit (column 7, lines 34-35, wherein it is inherent for computer 
to have input/output device circuit); 

a memory (column 7, lines 34-35, wherein it is inherent for computer to have 
memory); 

a processor that receives source-identified training stories via the input/output 
circuit (column 7, lines 34-35, wherein it is inherent for computer to have processor); 

and a link determining circuit that determines and displays on a computer or 
stores in an information repository, links between story-pairs based on a predictive 
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model in the memory and the inter-story similarity vectors (column 10, lines 5-13, 
wherein sorting determines the link; column 6, lines 57-59). 

Sundaresan et al. does not teach an inter-story similarity vector determining 

circuit that determines inter-story similarity vectors in the memory for the story-pairs of 

the source-identified stories. 

Pirolli et al. teaches an inter-story similarity vector determining circuit that 
determines inter-story similarity vectors in the memory for the story-pairs of the source- 
identified stories (Pirolli et al., column 7, lines 53-65, wherein "pages" could mean 
"stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
an inter-story similarity vector determining circuit that determines inter-story similarity 
vectors in the memory for the story-pairs of the source-identified stories because it 
provides the similarity measure of documents. 

Sundaresan et al. as combined still does not teach the link indicating the story- 
pair are related to the same event. 

Maybury et al. does teach the link indicating the story-pair are related to the 
same event (paragraph 16, lines 31-33). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the Sundaresan et al. as modified by teachings of 
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Maybury et al. to include the link indicating the story-pair are related to the same event 
(Maybury et al., paragraph 16, line 33). 

As per claim 59 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit is comprised of: 

a similarity metric determining circuit that determines at least one inter-story 
similarity metric for the story-pairs (Sundaresan et al., column 4, lines 9-25); 

and a similarity statistics determining circuit that determines source-pair statistics 
for the story-pairs (column 10, lines 15-17). 

As per claim 60 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit normalizes the inter-story similarity metric based on 
the source-pair statistics (.Sundaresan et al., column 10, lines 17-22). 

As per claim 61 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit incrementally normalizes the inter-story similarity 
metric based on the source-pair statistics (Sundaresan et al., column 10, lines 16-22). 

As per claim 62 Sundaresan et al. as modified is directed to at least one of the 
inter-story similarity metrics is normalized based on at least one of a subtraction and a 
division operation (Sundaresan et al. column 8, lines 22-27). 

As per claim 66 Sundaresan et al. as modified is directed to a comprising the 
step of transforming the source-identified training stories (Sundaresan et al., column 1, 
line 63, wherein the "training stories" are in English). 
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As per claim 67 Sundaresan et al. as modified is directed to transforming the 
source-identified training stories is at least one of translating, transcribing and 
linguistically transforming (Sundaresan et al., column 1 , line 63; column 2, line 43, 
wherein the HTML and XML are in English therefore translation will not be necessary). 

As per claim 71 Sundaresan et al. as modified is directed to the at least one 
inter-story similarity metric is normalized based on at least one of a source-pair 
identified similarity statistic (Sundaresan et al., column 10, lines 15-17). 

As per claim 72 Sundaresan et al. as modified is directed to the predictive model 
is at least one of: a classifier, a support vector machine and a decision tree, a Naive- 
Bayes classifier (Sundaresan et al., column 8, lines 22-27). 

As per claim 73 Sundaresan et al. as modified is directed to the source-pair 
identified similarity, statistic is determined based on a source hierarchy (Sundaresan et 
al., column 3, lines 50-51). 

As per claim 74 Sundaresan et al. as modified is directed to the source hierarchy 
is determined based on at least one of a source characteristic (.Sundaresan et al., 
column 3, lines 61-65, wherein "characteristic" means "leaf). 

As per claim 75 Sundaresan et al. as modified is directed to the source 
characteristic is at least one of a language characteristic, an input mode characteristic, 
a genre characteristic, a source name characteristic and a transformation characteristic 
(Sundaresan et al., column 3, lines 54-60, wherein "language characteristic" means how 
the words in a document are related to each other). 
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As per claim 76 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristics of the new source (Sundaresan et al.., column 3, lines 50-53, wherein 
each new source has different hierarchy). 

As per claim 81 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code processable to 
program a computer to determine at least one predictive model for a linked event 
detection system by executing steps comprising: 

determining source-identified training stories (column 3, lines 16-17, wherein 
"stories" means "documents"); 

determining link label information for the at least one story-pair (column 9, lines 
8-9); 

and determining and storing at least one predictive model in the memory based 

on the inter-story similarity vector and the link label information (column 7, lines 24-25; 

column 10, lines 5-13); 

Sundaresan et al. does not teach determining inter-story similarity vectors in a 
memory for at least one story-pair of the source-identified training stories 
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Pirolli et al. teaches determining inter-story similarity vectors in a memory for at 
least one story-pair of the source-identified training stories (Pirolli et al., column 7, lines 
53-65, wherein "pages" could mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
determining inter-story similarity vectors in a memory for at least one story-pair of the 
source-identified training stories because it provides the similarity measure of 
documents. 

Sundaresan et al. as combined still does not teach the link label information 
indicating training stories are related to the same event. 

Maybury et al. does teach the link label information indicating training stories are 
related to the same event (paragraph 16, lines 31-33). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the Sundaresan et al. as modified by teachings of 
Maybury et al. to include the link label information indicating training stories are related 
to the same event (Maybury et al., paragraph 16, line 33). 

As per claim 82 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code processable to 
program a computer to determine at least one predictive model for a linked event 
detection system, the computer readable program code comprising: 
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instructions to determine source-identified training stories (column 3, lines 16-17, 
wherein "stories" means "documents"); 

instructions to determine link label information for the at least one story-pair 
(column 9, lines 8-9); 

and instructions to determine and store at least one predictive model in the 
memory based on the inter-story similarity vector and the link label information (column 
7, lines 24-25; column 10, lines 5-13); and 

Sundaresan et al. does not teach instructions to determine inter-story similarity 
vectors in memory for at least one story-pair of the source-identified training stories. 

Pirolli et al. teaches instructions to determine inter-story similarity vectors in 
memory for at least one story-pair of the source-identified training stories (Pirolli et al., 
column 7, lines 53-65, wherein "pages" could mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
instructions to determine inter-story similarity vectors in memory for at least one story- 
pair of the source-identified training stories because it provides the similarity measure of 
documents. 

Sundaresan et al. as combined still does not teach the link label information 
indicating training stories are related to the same event. 
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Maybury et al. does teach the link label information indicating training stories are 
related to the same event (paragraph 16, lines 31-33). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the Sundaresan et al. as modified by teachings of 
Maybury et al. to include the link label information indicating training stories are related 
to the same event (Maybury et al., paragraph 16, line 33). 

As per claim 83 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code processable to 
program a computer to detect linked events by executing steps comprising: 

determining source-identified stories (column 3, lines 16-17, wherein* "stories" 
means "documents"); 

determining at least one predictive model in the memory for link detection 
(column 9, lines 8-9); 

determining a link between story-pairs based on the at least one predictive model 
and the inter-story similarity vectors (column 10, lines 5-13); and 

displaying the link on a computer or storing the link in .an information repository, 
(column 6, lines 57-59). 

Sundaresan et al. does not teach determining inter-story similarity vectors in a 
memory for the at least one story-pair of the source-identified stories. 
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Pirolli et al. teaches determining inter-story, similarity vectors in a memory for the 
at least one story-pair of the source-identified stories (Pirolli et al., column 7, lines 53- 
65, wherein "pages" could mean "stories"). It would have been obvious to one in of 
ordinary skill in the art at the time the invention was made to modify Sundaresan et al. 
by teachings of Pirolli et al. to include determining inter-story similarity vectors in a 
memory for the at least one story-pair of the source-identified stories because it 
provides the similarity measure of documents. 

Sundaresan et al. as combined still does not teach the link indicating the story- 
pairs are related to the same event. 

Maybury et al. does teach the link indicating the story-pairs are related to the 
same event (paragraph 16, lines 31-33). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the Sundaresan et al.. as modified by teachings of 
Maybury et al. to include the link indicating the story-pairs are related to the same event 
(Maybury et al., paragraph 16, line 33). 

As per claim 84 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code executable to program 
a computer to detect linked events comprising the steps of: 

instructions to determine source-identified stories(column 3, lines 16-17, wherein 
"stories" means "documents"); 
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instructions to determine at least one predictive model in a memory for link 
detection (column 9, lines 8-9); 

instructions to determine a link between story-pairs based on the predictive 
model and the inter-story similarity vectors (column 10, lines 5-13); ); and 

instructions displaying the link on a computer or storing the link in an information 
repository, (column 6, lines 57-59). 

Sundaresan et al. does not teach instructions to determine inter-story similarity 
vectors in a memory for the at least one story-pair of the source-identified stories. 

Pirolli et al. teaches instructions to determine inter-story similarity vectors in a 
memory for the at least one story-pair of the source-identified stories (Pirolli et al., 
column 7, lines 53-65, wherein "pages" could mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
instructions to determine inter-story similarity vectors in a memory for the at least one 
story-pair of the source-identified stories because it provides the similarity measure of 
documents. 

Sundaresan et al. as combined .still does not teach the link indicating the story- 
pairs are related to the same event. 

Maybury et al. does teach the link indicating the story-pairs are related to the 
same event (paragraph 16, lines 31-33). 
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It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to combine the Sundaresan et al. as modified by teachings of 
Maybury et al. to include the link indicating the story-pairs are related to the same event 
(Maybury et al., paragraph 16, line 33). 

As per claims 85 and 86 Sundaresan et al. as modified is directed to determining 
at least one source-pair statistic for the at least one story-pair is based on at least one 
of a similarity metric and a statistic associated with the metric (Sundaresan et al., 
column 3, lines 25-29, wherein the statistical algorithm uses metric for the 
computations). 

As per claims 87 and 88 Sundaresan et al. as modified is directed to at least one 
of the predictive models is a trained predictive model (Sundaresan et al., column 10, 
lines 29-33, wherein the "trained predictive model" is determined by use of statistical 
model). 

7. Claims 6-8, 25-27, 44-46, and 63-65 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Sundaresan et al. (US Patent 6,606,620 B1) in view of Pirolli et 
al. (US 5,835,905) and further in view of Gange et al. (US 2004/006559 A1 ) and further 
in view of Maybury et al. (US 6,961 ,954 B1 ). 

As per claims 6, 25, 44 and 63 Sundaresan et al. as modified fails to teach the 
use of probability based metric and a Euclidean based similarity metric. 
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Gan.qe et al. teaches the use of Euclidean distance (Gan.qe et al., page 3, 
paragraph 0045). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
Ganqe et al. to include the use of Euclidean distance as it is metrics often used in the 
database field to compute distances between similar terms. 

As per claims 7, 26, 45 and 64 Sundaresan et al. as modified fails to teach the 
use of similarity metric is at least one of a Hellinger, a Tanimoto and a clarity distance 
based metric. Ganqe et al: teaches the use of Tanimoto coefficient (Ganqe et al., page 
3, paragraph 0045). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
Ganqe et al. to include the use of Tanimoto coefficient as it is metrics often used in the 
database field to compute distances between similar terms. 

A per claims 8, 27, 46 and 65 Sundaresan et al. as modified fails to teach the use 
of inter-story similarity metric is a cosine-distance based metric. Ganqe et al. teaches 
the use of Cosine coefficient (Ganqe et al., page 3, paragraph 0045). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
Ganqe et al. to include the use of Cosine coefficient as it is metrics often used in the 
database field to compute distances between similar terms. 
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8. Claims 11-13, 30-32, 49-51 and 68-70 are rejected under 35 U.S.C. 1 03(a) as 
being unpatentable over Sundaresan et al. (US Patent 6,606,620 B1 ) in view of Pirolli et 
al. (US 5,835,905) and in further view Zhou (US 2004/0002849 A1 ) and further in view 
of Maybury et al. (US 6,961 ,954 B1 ). 

As per claims 1 1 , 30, 49 and 68 Sundaresan et al. as modified fails to teach the 
inter-story similarity metrics are based on terms in at least one source-identified term 
frequency-inverse story frequency models. Zhou teaches the use of frequency-inverse 
(Zhou, page 3, column 2, paragraph 0030, lines 9-11). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
Zhou to include the use of frequency-inverse because it predicts effective example of 
sentence retrieval as stated on page 1 , column 1 , paragraph 0005 of Zhou. 
As per claims 12, 37, 50 and 69 Sundaresan et al. as modified fails to teach the 
terms in source-identified term frequency-inverse story frequency models are based on 
language. 

Zhou teaches that the retrieved samples are to aid in writing or translation (Zhou, 
page 3, paragraph 0030, lines 2-4, wherein writing or translating has basis in language). 
It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
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Zhou to include the inverse-frequency based on language because term comparison 
includes terms of a language. 

As per claims 13, 32, 51 and 70 Sundaresan et al. as modified fails to teach 
determining terms comprises the steps: determining a reference language; and 
determining reference language and non-reference language terms. 

Zhou teaches the changing of sample terms from one mode to another (Zhou, 
page 3, paragraph 0032). 

It would have been obvious to on of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 

Zhou to include the determination of reference language since the correct translation 
requires the correct reference language. 

9. Claims 77-80 are rejected under 35 U.S.C. 1 03(a) as being unpatentable over 
Wayne (Topic Detection and Tracking in English and Chinese, Charles Wanye, 
Published in 2000) in view of Wu et al. (US Publication 2005/0289463, filed June 23, 
2004). 

As per claim 77, Wayne teaches a method of determining a stop word list 
comprising the steps of: 
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determining a source-identified training corpus of text information (pg. 166, section 3, 
and table 1 , sources); 

determining a verified first source-mode transformation of the source-identified 
training corpus text from a first mode to a second mode based on at least one 
of a verified transcription and a verified translation (pg. 166-167, sections 3.2-3.5, 
verified translations and transcriptions of the source corpora); 

determining an un-verified second source-mode transformation of the source- 
identified training corpus text from a first mode to a second mode (pg. 166-167, sections 
3.2-3.5, verified translations and transcriptions of the source corpora, specifically audio 
transcription to manual closed caption quality, but not verified as stated in the 
specification); 

determining at least one transformation error associated with distribution 
differences between the first and second transformations and identified 
sources (Figure 5, determining the transformation errors, and distribution differences 
between Chinese audio/text and English audio/text); 

Wayne does not explicitly disclose determining and storing at least one source- 
specific transformation action for the determined transformation errors in a memory and 
identifying and transforming transformation errors in other transformed source-identified 
texts based on the source-specific transformation actions in the memory. 

Wu discloses determining and storing at least one source-specific transformation 
action for the determined transformation errors in a memory ([0021], using a set of 
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inputs to train the transformation rules); and identifying and transforming transformation 
errors in other transformed source-identified texts based on the source-specific 
transformation actions in the memory ([0021], using the set of inputs to train the 
transformation rules allows the most common spelling errors and corrections to be 
determined). 

Wayne and Wu are analogous art because they are relevant to solving the same 
problem, transformation and translation errors in documents. It would have been 
obvious to one of ordinary skill in the art, at the time of the invention to modify the 
method of Wayne to include the auto-correction of Wu to allow documents to be auto- 
corrected. The suggestion/motivation to combine is that this process enhances the 
efficiency and effectiveness of the correction system (Wu, [0021]). Therefore it would 
have been obvious to combine the above references to obtain the instant invention. 

As per claim 78 Wayne as modified is directed to the first mode is at least one of 
a text source, an optical character recognition source and an automatic speech 
recognition source (pg. 166, section 3.2). 

As per claim 79 Wayne as modified is directed to the second mode is at least 
one of a text source, an optical character recognition source and an automatic speech 
recognition source (pg. 166, section 3.2). 
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10. Claim 80 is rejected under 35 U.S.C. 103(a) as being unpatentable over Wayne, 
in view of Wu, and in further view of Brown (Dynamic Stopwording for Story Link 
Detection, Ralph Brown). 

As per claim 80 the combination of Wayne and Wu do not disclose wherein the 
source-specific transformation is at least one of a removal, a repair and a normalization 
transformation. Brown is directed to wherein the source-specific transformation is at 
least one of a removal, a repair and a normalization transformation (Brown, page 2, 
column 1 , lines 4-6). 

Wayne, Wu and Brown are analogous art because they are relevant to solving 
the same problem, transformation and translation errors in documents. It would have 
been obvious to one of ordinary skill in the art, at the time of the invention to modify the 
method of Wayne and Wu to include the transformation of Brown to allow one of the 
above transformation methods to be used. The suggestion/motivation to combine is to 
allow efficient removal of stopwords (Brown, page 2, column 1 , lines 4-6). Therefore it 
would have been obvious to combine the above references to obtain the instant 
invention. 
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Response to Arguments 

1 1 . Applicant's arguments filed 1/21/2008 have been fully considered but they are 
not persuasive. In response to applicant's arguments with respect to claim 77, they are 
considered moot in light of new grounds of rejection. 

12. In response to applicant's arguments against the references individually, one 
cannot show nonobviousness by attacking references individually where the rejections 
are based on combinations of references. See In re Keller, 642 F.2d 413, 208 
USPQ 871 (CCPA 1981); In re Merck & Co., 800 F.2d 1091, 231 USPQ 375 (Fed. Cir. 
1986). 

As to applicant's argument that Sundaresen et al. does not teach source of the 
identified stories is found not persuasive. Sundaresan et al. teaches in column 6, lines 
48-49 that web pages and identified by URL and come from web sites (column 6, lines 
65-68) that are associated with particular internet domain name, and include the content 
of a particular organization. 

As to applicant's argument that Pirolli et al. does not teach determining vector as 
per limitation of claim 1 is found not persuasive. The examiner combined the 
Sundaresan et al. and Pirolli et al. references wherein Sundaresan et al. teaches source 
(as pointed in previous argument) wherein the particular organization could be cnn, nbc 
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etc. therefore the broadest interpretation of the limitation is covered by 
combining references. 

As to applicant's argument that Sundaresan et al. does not teach determining link 
label information is found not persuasive. Classifying documents into a categories 
means that the documents in a particular directory are linked by similarity of terms or 
concepts etc. 

As to applicant's argument that Maybury does not teach indicating of existence of 
stories linked to the same event is found not persuasive. Maybury describes teaches a 
system that finds interrelated stories using segmentation in column 19, lines 33-38. 



Conclusion 



1 3. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. Hauptmann, Topic Detection of Multilingual Broadcast News in 
the Informedia Digital Video Library ; Cieri, Large, Multilingual, Broadcast News Corpora 
for Cooperative Research in Topic Detection and Tracking . 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to JEFFREY A. BURKE whose telephone number is 
(571)270-3844. The examiner can normally be reached on M-R: 7:30 - 5. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, James Trujillo can be reached on 571-272-3677. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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